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© Process for the production of protein products in Aspergillus oryzae and a promoter for use In 
Aspergillus. 

© A process for expression of a protein product in Aspergillus oryzae is disclosed. The process comprises 
transforming Aspergillus oryzae with a vector system comprising DNA-sequences encoding functions facilitating 
gene expression, a suitable marker for selection of transfbrmants, and a DNA-sequence encoding the desired 
protein product The process enables industrial production of many different polypeptides and proteins in A . 
oryzae. Examples of such products are chymosin or prochymosin and other rennets, proteases, lipases and 
amylases. Also disclosed is an effective promoter for expression of a protein in Aspergillus . A preferred 
promoter is the TAKA-amylase promoter or functional parts thereof. 
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PROCESS FOR THE PRODUCTION OF PROTEIN PRODUCTS IN ASPERGILLUS ORYZAE AND A PRO- 
MOTER FOR USE IN ASPERGILLUS 



BACKGROUND OF THE INVENTION 

The present Invention relates to a process for expression of protein products in Aspergillus oryzae . 
recombinant DNA vectors, a promoter for Aspergillus and transformed fungi. ^ — — 

In the past, numerous processes have been developed for the production of polypeptides or proteins by 
means of the recombinant DNA technology. The mam interest has been concentrated on bacteria and 
yeast, e.g. E. aofi. Bacillus subtilis and Saccharomyces cerevisiae being well characterized species as 
regards for instance expression and selection systems. 

Besides the above mentioned microorganisms, filamentous fungi, such as Aspergillus niger , are 
attractive candidates as host microorganisms for recombinant DNA vectors being well-characterized and 
widely used microorganisms for the commercial production of enzymes. Efforts have especially been 
concentrated on the development of transformation systems by which a selection marker permitting 
selection of transformants from the untran stormed host microorganisms is used. 

In the last few years different selection markers for the transformation of Aspergillus nidulans have 
been described and procedures have recently been developed for integrative transformation of the 
filamentous fungus Aspergiilus nidulans for the purpose of investigation of the genetic and molecular 
processes controlling fungal cell differentiation. 

Transformation of A. nidulans has been demonstrated by using piasmids containing the Neurospora 
crassa Pyr-4 gene (Ballahce, DJ. et a)., Biochem.Biophys.Res.Commun., 112 (1983) 284-289), the A. 
nidulans amdS gene (Tilbum. J. G. et al. v Gene 26 (1983) 205-221), the A. nidulans trpC gene (Yelton. 
M.M. et al., Proc.NatI.Acad.ScL U.S.A., §1 (1984) 1470-1474) and the A. nidulans argB gene (John, M.A. 
and Peberdy J., Microb.TechnoI. 6 (1984) 386-389). The trans forming DNA was found to be Integrated into 
the host genome at rather low frequencies (typically < 1000 transform ants/ug of DNA). 

Very recently transformation of Aspergillus niger with the amdS gene of A . nidulans was described - 
(Kelly, J.M. and Hynes, M.J., EMBO Journal 4 (1985), 475-479) and amdS was shown to be a potential 
selection marker for use in transformation of Aspergillus niger that cannot grow strongly on acetamide as a 
sole nitrogen source. Transformation of Aspergillus niger using the argB gene of Aspergillus nidulans has 
also been described recently (Buxton, F. P. et a!., Gene 37 (1985), 207-214). 

So far no systems have been developed for expression of foreign proteins in the filamentous fungi 
Aspergillus oryzae mainly due to insufficient knowledge of how to control gene expression in this fungus 
and due to the lack of suitable selectable genetic markers on cloning vectors. 



BRIEF DESCRIPTION OF THE INVENTION 

According to the present invention it has now been shown that the above transformation techniques can 
be used to obtain a high level of expression of heterologous proteins or to enhance the production of 
homologous proteins in Aspergillus oryzae . 

As used herein the expression "heterologous proteins" means proteins not produced byA. oryzae 
whereas "homologous proteins" means proteins produced byA. oryzae itself. ~ — ~— 

More specifically it has been shown that selection for A. oryzae strains transformed with DNA encoding 
a desired protein product is possible by use of the markergenes used for transformation of A. niger and A . 
nidulans. Due to the phylogenetic distance between these latter fungi andA. oryzae (Raper, K.B. and Fennel. 
D.I. (1965) The Genus Aspergillus) this was by no means to be foreseen. 

According to a first aspect of the present invention there is provided a process for expression of a 
protein product in Aspergillus oryzae comprising the steps of: 

(a) providing a recombinant DNA cloning vector system capable of integration into the genome of an 
Aspergillus oryzae host in one or more copies and comprising: DNA-sequences encoding functions 
facilitating gene expression; a suitable marker for selection of transformants; and a DNA-sequence encoding 
the desired protein product; 

(b) transforming the Aspergillus oryzae host which does not harbour a functional gene for the chosen 
selection marker with the recombinant DNA cloning vector system from step a; and 

(c) culturing the transformed Aspergillus oryzae host in a suitable culture medium. 
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According to a second aspect of the present invention there is provided a highly effective promoter for 
expression of a protein product in Aspergillus , especially in Aspergillus oryzae and Aspergillus niger. which 
promoter is characterized as being the TAKA-amylase promoter or functional parts thereof optionally 
preceded by upstream activating sequences. 

According to a third aspect of the present invention there is provided a method for production or a 
protein product inAspergillus oryzae by which method an Aspergillus oryzae strain being transformed with 
a recombinant DNA cloning vector system as described above is cultured in a suitable culture medium and 
the product is recovered from the culture medium. 
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BRIEF DESCRIPTION OF THE DRAWINGS: 



The present invention is further illustrated by reference to the accompanying drawings in which: 

Fig. 1 shows the DNA-sequence of the TAKA-amylase promoter and upstream promoter regions, the 
15 preregion and the 5'part of the structural gene for the TAKA-amylase. 

Hg. 2 shows an endonuclease restriction map of pldsmid pTAKA17, 
Fig. 3 illustrates the construction of plasmid p285'proC ? 

Ra 4a and b shows the DNA sequence of prepro Rhizomucor miehei aspartic proteinase cDNA 
together with the deduced amino acid sequence given by three-letter abbreviations. 
20 Fig. 5 illustrates the construction of plasmid pRMP, 

Fig. I shows the endonuclease restriction map of plasmid pCAMG91 

Fig. 7a illustrates the construction of plasmid pICAMG/Term, 

Fig. 7b illustrates the construction of plasmid p686 

Fig. 8 illustrates the construction of plasmid pRMPAMGTerm, 
26 Fig. 9a illustrates the construction of plasmid pB408.3, 

Fig. 9b illustrates the construction of plasmid p778, 

Rg. 10 illustrates the construction of plasmid p719 f 

Fia. 11 illustrates the construction of plasmid p777, 

SoTlhe sequence of prepro Rhizomucor miehei lipase cDNA together wrth the deduced 

30 amino acid sequence given by three-letter abbreviations. 

Rg. 13a illustrates the construction of plasmid plasmid pB544, • 

Fig. 13b illustrates the construction of plasmid p787, 

Fig. 14 shows the DNA sequence of a synthetic fragment RML5', 

Fig. 15a illustrates the construction of plasmid pTOC51 and 
as Fig. 15b illustrates the construction of plasmid pTOC56. 
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DETAILED DESCRIPTION OF THE INVENTION 

The transformation technique used was a method adapted from the methods for tra^formaton of A. 
nidulans (Ballance et a!. Biochem.Biophys.Res.Commun.. 112 (1983). 284-289; Tilburn et ah Gene 26 - 
&-205-221. Yelton et a,. PrTO .NatLAcad.Sci. USA. 81 (1984) 1470-1474) and similar to fte meW ^of 
Buxton et al. (Gene 37 (1985). 207-214) for transformation of A, niger. In the process of the present 
fnvention Aspergillus &za* is transformed with a vector system containing a se.ect.on marker which* 
capable of being Inc oTp^aTsd into the genome of the host strain, but which is not harbourec im ^*host 
stSin before the transformation. Transformarrts can then be selected and isolated from nontransformants on 
the basis of the incorporated selection marker. 

Preferred selection markers are the argB (A. nidulans or A. nig^) trpC ^J^^^^ 
nidulans) . or pyr4 (Neurospora crassa) genes, or the DHFR (dihydrofolate reductase or mutants hereoO 
gl^iore pnJerredlelecln- =eTs are the argB or the amdS gene Wild type J ;^*™»« 
normally argB* (i.e. the argB gene is functional in A. oryzae). If argB is chosen as the M Jff ™ 
JgB mutanl strain of A . oryzae which has a defect in the gene for this ^ mu ^^^^; 
A. oryzae argB mutante c^nbTprepared as described by F.P. Buxton et al. (Qene 37 (1985). 207-2l4^An 
a7gB^utent is defined as a mutant having a defect in the omithin transcarbarny lase gene. Onthe other 
hand the amdS gene may be used as selection marker for the transformation of wild type A. oryzae as the 

wild type strains do not contain this gene. . 

DMA-sequences encoding functions facilitating gene expression are typically promoters, transcnption 

terminators and polyadenylation signals. 
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The promoter, which might be preceded by upstream activating sequences and enhancer sequences as 
well known in the art. may be any DNA-sequence that might show strong transcriptional activity in 
As P er 8 Hlus 2D£S£ a™ 1 ">ay be derived from genes encoding both extracellular and intracellular proteins 
such as amylases, glucoamylases. proteases. lipases, cellulases and .glycolytic enzymes. Suitable promot- 
ers may be derived from genes for A. oryzae TAKA amylase. Rhizomucor miehei aspartic proteinase. A. 
njger glucoamylase. A. njger neutral a-amylase, A. nicer acid stable a-amylase. and Rhizomucor miehei 
lipase. Examples of promoters from genes for glycolytic enzymes are TPI, ADH and PGK. 

A preferred promoter according to the present Invention is the A. oryzae TAKA-amylase promoter The 
TAKA amylase is a well known a-amylase (Toda et ai.. ProcJapan"Acad. 58 Ser. B. (1982) 208-212) DNA 
encoding the promoter region was derived from the~TAKA-amylase genomic clone. The sequence of the 
promoter and regions upstream to the promoter together with the preregion and the 5'end of the structural 
gene for the TAKA-amylase illustrated in Rg.1. 

As described in further detail in example 2 a DNA-sequence encoding the TAKA-amylase including the 
preregion and promoter and upstream activating sequences was derived from a A oryzae mycelium and 
inserted in BamHI digested pBR322 to give plasmid pTAKA 17 (see Rg. 2). In pTAKA17 the A. oryzae 
derived DNA is shown as a 5.5 kb BamHI/Sau 3AI -BamHl/Sau 3AI fragment the promoter artd upsfream" 
activating sequences representing a 2.1 kb fragment starting at position O. The established DNA-sequence 
of the promoter and upstream activating sequences up to the Bglll site is shown in Rg. 1. The promoter 
ends at nucleotide-1 preceding the Met(1) codon of the TAKA-amylase presequence. The nucleotide 
20 sequence encoding the presequence is constituted of 63 nucleotides and the mature TAKA-amylase starts' 
at a position corresponding to nucleotide 64. 

From pTAKA 17 the whole promoter sequence including sequences upstream to the promoter or 
functional parts thereof may be derived by means evident to the person skilled in the art The promoter 
sequence may be provided with linkers with the purpose of introducing specific restriction sites facilitating 
the ligation of the promoter sequence with further DNA. for instance the gene encoding the desired protein 
product or different preregions (signal peptides). 

In the method according to the present invention the sequence from nucleotide -1144 (see Rg 1) - 
(representing the start of a Sail site) to nucleotide -10 has been used as one example of a well functioning 
part of the promoter region. In another embodiment of the present invention the nucleotide sequence from 
nucleotide -1176 to -1 was preceded by the still not sequenced 1.05 kb fragment from pTAKA 17 It is 
evident for the person skilled in the art that different fragments can be used. 

According to one embodiment of the present invention the promoter and upstream activating sequences 
have the following sequence or a functionally equivalent nucleotide sequence: 

GTCGACGC ATTCC6AATA CGAGGCCTGA TTAATGATTA CATACGCCTC CGGGTAGTAG ACCGAG- 

3S CAGC CGAGCCAGTT CAGCGCCTAA AACGCCTTAT AC AATTAAG C AGTTAAAGAA GTTAGAATCT ACG- 
CTTAAAA AGCTACTTAA AAATC6ATCT CGCAGTCCCG ATTCGCCTAT CAAAACCAGT TTAAATCAAC 
TGATTAAAGG TGCCGAACGA GCTATAAATG ATATAACAAT ATTAAAGCAT TAATTAGAGC 
AATATCAGGC CGCGCACGAA AGGCAACTTA AAAAGCGAAA GCGCTCTACT AAACAGATTA 
CTTTTGAAAA AGGCACATCA GTATTTAAAG CCCGAATCCT TATTAAGCGC CGAAATCAGG 

40 CAGATAAAGC CATACAGGCA GATAGACCTC TACCTATTAA ATCGGCTTCT AGGCGCGCTC CATC- 
TAAATQ TTCTQeCTGT GGTGTACAGG GGCATAAAAT TACGCACTAC CCGAATCGAT AGAACTACTC 
ATTTTTATAT AGAAGTCAGA ATTCATAGTG TTTTGATCAT TTTAAATTTT TATATGGCGG 
GTGGTGGGCA ACTCGCTTGC GCGGGCAACT CGCTTACCGA TTACGTTAGG GCTGATATTT ACGT- 
GAAAAT CGTCAAGGGA TGCAAGACCA AAGTAGTAAA ACCCCGGAAG TCAACAGCAT CCAAGCCCAA 

46 GTCCTTCACG GAGAAACCCC AGCGTCCACA TCACGAGCGA AGGACCACCT CTAGGCATCG GACG- 
CACCAT CCAATTAGAA GCAGCAAAGC GAAACAGCCC AAGAAAAAGG TCGGCCCGTC GGCCTTTTCT 
GCAACGCTGA TCACGGGCAG CGATCCAACC AACACCCTCC AGAGTGACTA GGGGCGGAAA TTTAAAG- 
GGA TTAATTTCCA CTCAACCACA AATCACAGTC GTCCCCGGTA TTGTCCTGCA GAATGCAATT 
TAAACTCTTC TGCGAATCGC TTGGATTCCC CGCCCCTAGT CGTAGAGCTT AAAGTATGTC 

so CCTTGTCGAT GCGATGTATC ACAACATATA AATACTAGCA AGGGATGCCA TGCTTGGAGG ATAG- 
CAACCG ACAACATCAC ATCAAGCTCT CCCTTCTCTG AACAATAAAC CCCACAG representing the se- 
quence from nucleotide -1 144 to -10 in Rg. 1 . 

According to a further embodiment the promoter and upstream activating sequences have the followmo 
sequence or a functionally equivalent nucleotide sequence: * 

ss AGATCTGCCC TTATAAATCT CCTAGTCTGA TCGTCGACGC ATTCCQAATA CGAGGCCTGA TTAAT- 
?~ITfL CATACGCCTC CGGGTAGTAG ACCGAGCAGC CGAGCCAGTT CAGCGCCTAA AACCCCTTAT 
^^TL^iL AGTTAAAGAA GTTAGAATCT ACGCTTAAAA AGCTACTTAA AAATCGATCT 
CGCAGTCCCG ATTCGCCTAT CAAAACCAGT TTAAATCAAC TGATTAAAGG TGCCGAACGA 



25 



30 



4 



# 



36 



40 



0 238 023 



GCTATAAATG ATATAACAAT ATTAAAGCAT TAATTAGAGC AATATCAGGC CGCGCACGAA AGGCAAC- 
GCGCTCTACT AAACAGATTA CTTTTGAAAA AGGCACATCA GTATTTAAAG 
CCCGAATCCT TATTAAGCGC CGAAATCAGG CAGATAAAGC CATACAGGCA GATAGACCTC TACCTAT- 
TAA ATCGGCTTCT AGGCGCGCTC CATCTAAATG TTCTGGCTGT GGTGTACAGG GGCATAAAAT TACG- 

5 CACTAC CCGAATCGAT AGAACTACTC ATTTTTATAT AGAAGTCAGA ATTCATAGTG TTTTGATCAT 
TTTAAATTTT TATATGGCGG GTGGTGGGCA ACTCGCTTGC GCGCGCAACT CGCTTACCGA TTACGT- 
TAGG GCTGATATTT ACGTGAAAAT CGTCAAGGGA TGCAAGACCA AAGTAGTAAA ACCCCGGAAG 
TCAACAGCAT CCAAGCCCAA GTCCTTCACG GAGAAACCCC AGCGTCCACA TCACGAGCGA AGGAC- 
CACCT CTAGGCATCG GACGCACCAT CCAATTAGAA GCAGCAAAGC GAAACAGCCC AAGAAAAAGG 

70 TCGGCCCGTC GGCCTTTTCT GCAACGCTGA TCACGGGCAG CGATCCAACC AACACCCTCC AGAGT- 
GACTA GGGGCGGAAA TTTAAAGGGA TTAATTTCCA CTCAACCACA AATCACAGTC GTCCCCGGTA 
TTGTCCTGCA GAATGCAATT TAAACTCTTC TGCGAATCGC TTGGATTCCC CGCCCCTAGT 
CGTAGAGCTT AAAGTATGTC CCTTGTCGAT GCGATGTATC ACAACATATA AATACTAGCA AGGGATG- 
CCA TGCTTGGAGG ATAGCAACCG ACAACATCAC ATCAAGCTCT CCCTTCTCTG AACAATAAAC 

75 CCCACAGAAG GCATTT representing the sequence from nucleotide -1176 to -1 in Fig. 1. 

According to a further aspect of the present invention the latter sequence may be preceded by the 1 05 
kb unsequenced upstream region from pTAKA 17 (position 0 to 1.05 in Rg. 2). 

The terminators and polyadenylation sequences may be derived from the same sources as the 
promoters. Enhancer sequences may also be inserted into the construction. 

20 The expressed product may be accumulated within the cells requiring disruption of the ceils to isolate 
the product To avoid this further process step and also to minimize the amount of possible degradation of 
the expressed product within the cells it is preferred that the product is secreted from the cells. For this 
purpose the gene for the desired product is provided with a preregion ensuring effective direction of the 
expressed product into the secretory pathway of the cell. This preregion which might be a naturally 

26 occuring signal or leader peptide or functional parts thereof or a synthetic sequence providing secretion is 
generally cleaved from the desired product during secretion leaving the mature product ready for isolation 
from the culture broth. 

The preregion may be derived from genes for secreted proteins from any source of organism. 
According to the present invention the preregion may be derived from a glucoamylase or an amylase 
30 gene from an Aspergillus species, an amylase gene from a Bacillus species, a lipase or proteinase gene 
from Rhizomucor miehei , the gene for the a-factor from S. cerevisiae or the calf prochymosin gene. More 
preferably the preregion is is derived from the gene fo7 A. oryzae TAKA amylase, A . niger neutral o- 
amylase, A. nigeracid-stable o-amylase, B. licheniformis o^amylase. the maltogenic amylase from Bacillus 
NCIB 11837, B. stearothermophilus a-amylase or B. licheniformis subtiiisin. An effective signal sequence is 
the A. oryzae TAKA-amylase signal, the Rhizomucor miehei aspartic proteinase signal and the Rhizomucor 

miehei lipase signal. 

The TAKA-amylase signal has the following sequence 
ATGATGGTCGCGTGGTGGTCTCTATTTCTGTACGGCCTTCAGGTCGCGGCACCT 
MetMetValAlaTrpTrpSerLeuPheLeuTyrGlyLeuGlnValAlaAlaPro 



GO I I I GGCT 
AlaLeuAla 

The gene for the desired product functionally linked to promoter and terminator sequences may be 
incorporated in a vector containing the selection marker or may be placed on a separate vector or plasmid 

45 capable^ of being integrated into the genome of the host strain. As used herein the expression "vector 
system" includes a single vector or plasmid or two or more vectors or plasmids which together contain the 
total DNA-information to be integrated into the host genome. Vectors or plasmids may be linear or closed 
circular molecules. According to a preferred embodiment of the present invention A. oryzae is cotransfdr- 
med with two vectors, one including the selection marker and the other comprising the remaining foreign 

so DNA to be introduced in the host strain, including promoter, the gene for the desired product and 
transcription terminator and polyadenylation sequences. 

Normally the A. oryzae transformarrts are stable and can be cultured in the absence of a selection 
pressure. If the transformants turn out to be unstable the selection marker may be used for selection during 
cultivation. The transformed cells are then cultured under a selection pressure corresponding to the marker 

55 in question. 
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or nmin ^r?n a P * * for production of hf 9 h yieWs <* man * polypeptide 

Z KZf^^ 22 ^- ~ 22222 has ^ years been used in commercial scale for the preSon 
If fnr ^^!!^" 3 '"^ , enZyme " nd Proteolytic enzymes and accordingly fermentation technoV 

e m '««^fn'sn. is well developed and the microorganism is approved for use in tnTfood 

TnTte I? e n PreS8nt nV6nti0n ^ *» * "sfngTSyse in thlindustria, proSction^f S 

amounts of ,n pnnaple any polypeptide or protein product Bcamptes of such products are chymosin or 
prochymosm and other rennets, proteases, amyfoglucosidases. acid stable amylases from^Se^SuT 
fungal ..pases or prokaryotic lipases, and thermostable bacterial and fungal amylase^ ^SDergjIIus. 
The present invention is illustrated by means of the production of prochymosin Rhizomucormiehei 

io aspartc proteinase. TAKA-amylase and a lipase from Rhizornucor m.ehei Hnizomucorm.enei 

. ^KSLtZST" were obtalned tan CDNA Dbraries or 9enomlc ,ibraries 38 ***** in 

75 EXAMPLES 

p285^'(ATCC '^ZOqS)^" 9 materials in * e following examples are as follows: 

pCAMG91: Boel et al. EMBO Journal 3 (1984). 1581-1585 
20 plCI9R: Marsh et al. Gene 32 (1984). 481-485 

fSS^SSSt 68,10 - (1983> ' 109 " 117 JOh " & P6berdy ' En2yme Microb - Techno'- 
p3SR2: J.M. Kelly and M.J. Hynes, EMBO Journal 4 (1985),! 475-479 
pBR322: Bolivar F. et aJL, Gene 2 (1977). 95-113. ~ 
zs pBR327: Covarrubias l_ et al., Gene 13 (1981). 25-35 

cSSS? *" PUC19: Vieira - " ' ^ ^ (1982) ' 259 - 258 ■* M—no. Math, in Enzymofogy IM - 
The strains used are as follows: 
A. nigen ATCC 1015. ATCC 10582 

12882^ ATCC 204231 4177 ' ATCC 1011 ' ATCC 9576 - ATCC 14488-11491. ATCC 11601 and ATCC 

| . cofc MC1000 (Casabadan. MJ. and Cohen. S.N.. J.Mol.Biol. 138 . 179-207) (NCIB 11956) 
Rhizornucor miehei : CBS 370.65 ; 
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Example 1 

Preparation of plasmid 285'proC containing the prochymosin gene 

of am3% e Zl^ m SL Q %Z™ '^Ti a «" Stomacn cDNA and inserted into the Psti site 

S 3 i2Tr al-.Biochemistry 18 (1979). 5294 and Tuelsen et al.. Nucteic Acids 
n^J f 06 } t °° btain j? 26 ' pUC9 was cut with Sail and filled cut with Klenow polymerase and 

T4 fl , ' 9a f e - 71,6 resultin 9 D,asmid was cut with BamHI-EcoRI and the 2.7 Kb large fragment was 

S£ ^SL°2Ir£ fragment from P^ 6 «—*•» the N-termina. end oX^ooCo^ 

gene to create PUC9'. P UC9- contains a Hindlll site N-terminally of the prochymosin gene. pUC13 was cut 

^ m :^'r an ^ ar, 'ST- aJ - ^ reSpective smaH fragments P wereTgated with aoS fcTSS 
DUCi?^n2n? '^."f 01 ^ 0 "^'" 31 6nd * Prochymosin gene to obtain plasmid P UC13'. 
omSI ^ LT Xba '" Site C-terminally of the prochymosin gene. A0.65 kb Xmal-Xbal fragment of 

SSfo ^L! 9 ^ ^oL 0 " 46 ^ Hind,l, - XmaI frag™* of PUC9- and a 11 kb Xbal-Hindlll SgmTnt of 
P285 to create plasmid p285' proC containing the prochymosin gene as illustrated in fig. 3. 
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Example 2 

Cloning of the A. oryzae TAKA-amylase A gene 
5 Isolaltion of a partial cDNA clone 

From A. oryzae Hw 325 grown on potato starch, mRNA was prepared according to Kaplan et al., 
Biochem. J. 183, (1979) 181 -84. A partial cDNA clone containing 1050 bp of the TAKA-amylase gene was 
obtained by specific priming of mRNA with a 14-mer oligonucleotide mixture: 



10 
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5 'GG A TT A TC A TG A TT 3'(NOR-168) 
G G G G 



complementary to the coding sequence for amino acids 295 -299 in TAKA-amylase (Toda et al., Proc. 
Japan Acad. 58, Ser. B, (1982) 208 -12). Cloning procedure was according to Gubler & Hoffmann, Gene 25, 
(1983) 263 -69. Sequencing at the ends and in the middle of the cDNA done demonstrated presence of 
sequences corresponding to the amino acid sequence of TAKA-amylase. 



Isolation of genomic clones 

Mycelium from A. oryzae Hw 325 was harvested and processed for preparation of DNA according to the 
method used for A. niger described by Boel et al. supra. Restriction fragments of 3 -10 kb, generated by 
partial digestion with Sau3A, were ligated with BamHI digested, dephosphorylated pBR322 (New England 
Biolabs). 50.000 recombinants were screened with oligonucleotide probe NOR-168 (see above), and 7 were 
found to contain DNA coding for TAKA-amylase. One clone was chosen for further use of the promoter 
region, having 2.1 kb upstream of the mRNA-start. Restriction map for plasmid pTAKA 17 is outlined in Fig. 
2. pTAKA 17 transferred into an EL coH strain was deposited with Deutsche Sammlung von Mikroorganismen 
(DSM), Griesebachstrasse 8, D-3400, GSttingen, on February 23, 1987 and accorded the reference number 
DSM 4012 DSM being an international depository authorized under the Budapest Treaty of 1977, affords 
permanence of the deposit and accessibility thereto by the public in accordance with rules 9 and 11, 
respectively, of the above treaty. 



Example 3 

Construction of a Rhizomucor miehei cDNA library 

40 Th© phycomycete fungus Rhizomucor miehei (for a morphological and taxonomical description of this 
species see: Schipper, MAA. On the genera Rhizomucor and Parasttella. In: Studies in mycology, Institute 
of the Royal Netherlands Academy of Sciences and Letters. No. 17 (1978), 53-71) secretes an acid 
proteinase (Rhizomucor miehei proteinase, in the following abreviated to RMP) which is widely used for 
clotting of milk in cheese production. In order to obtain cDNA recombinant clones of this protein inE. coji, 

^ total RNA was extracted from homogenized R. miehei mycelium as described by Boel et al. (EMBO J., 3: 
1097-1102, 1984) and Chirgwin et ah. (Biochemistry (Wash.), 18, 5294-5299, 1979). Poly(A)-containing RNA 
"was obtained by two cycles of affinity chromatography on oligo(dT)-cellulose as described by Aviv and 
Leder (PNAS, USA, 69, 1408-1412, 1972). Oligo(dT) primed complementary DNA was synthesized and 
made doubiestranded according to Gubler and Hoffman (Gene, 25, 263-269, 1983). Doublestranded cDNA 

50 was tailed with dCTP and terminal deoxynucleotidyl transferase as described by Roychoudhury et al. - 
(Nucleic Acids Res, 3, 101-106, 1976). The plasmid pBR327 was linearized with Pstl and tailed with dGTP. 
The oligo(dC) tailed dscDNA was annealed to this oligo(dG) tailed vector as described by Peacock et al. - 
(Biochim.Biophys.Acta, 655, 243-250, 1981) and used to transform a hsdR-, M* derivative of E. coli 
MC1000 (Casadaban and Cohen, J. Mol.BioL, 138, 179-207, 1980) to generate recombinant clones. 
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Identification of RMP specific cDNA recombinants 

A mixture of 16 heptadecamer oligodeoxyribonucleotides 

d ( GC A TCCCA A AA A TA A TA ) , 
G G G G 

oneof which is complementary to RMP mRNA in the region coding for Tyr-Tyr-Phe-Trp-Asp-Ala (Bech and 
to F0 ^f n ' N « J- 35: 275-280. 1981) was synthesized on an Applied ££££ (nTo^ 

ZT^IT P" ffi ^y polyacrylamide gel electrophoresis. Approximately 10,000 E. coli recombinants 

^ ^TSSSI!! 1 Jf ^^V^ transfetred to 540 paper fitters! The" colonies were 

lysed and immobilized as described by Gergen et al. (Nucleic Acids Res. 7. 2115-2135. 1979). The filters 

rjJ^"? 6 !^* " p - tebelted RMP-specffic heptadecamer-mixture-as described by Boel et aT 

is J " i. 1097-1102. 1984). Hybridization and washing of the filters were done at 40-C. followed by 

autoradiography for 24 hours with an intensifier screen. Miniprep plasmid DNA was isolated from a 
?S U y l Pf ^! P1016, by Stendard iwocedures (Birnboim and Dofy. Nucleic Acids Res.. 7. 1513- 

J^T',™ 1 and DNA sequence of the cDNA insert was established by the procedure of Maxam and 
Gilbert (Methods ; Enzymoi. 65. 499-560. 1980). pRMP1016 was shown to contain partof the 5' unSSaS 

M end of the mRNA and then extending through regions encoding a 69 amino acid-long preproregion and 300 
amino acids into the mature part of the RMP protein. Since pRMP1016 did not contain any insert 
corresponding to the complete 3* end of the RMP mRNA. the cDNA library was rescreened with a »P nick- 
translated 3* specific restriction fragment from clone pRMP1016. whereby clone pRMP2931 was isolated 
This clone contains part of the 3- untranslated region and an open reading frame with the 270 triplets 

25 encoding *e carboxyterminal part of the RMP protein. pRMP1016 and pRMP2931. therefore have extensive 
°™? P '. and COmbined sequence of the two clones gives the sequence of the R. miehei preproRMP 
cDNA A total of 1416 nucleotides was sequenced between the G:C tails resulting fromlhe~c~DNA cloning 
procedure. The established DNA sequence is shown in Fig. 4a and b together with the deduced amino acid 
sequence of a precursor to RMP. In Fig. 4a and 4b the horizontal line indicates the position of a synthetic 

30 °" go mbcture "sed for cDNA library screening. An arrow shows the position where processing occurs in 
maturation of native RMP. Nucleotides are numbered from the first base in the initiating Met-codon and 
amino acids are numbered from the first residue in the mature RMP. From this cDNA sequence it can be 
concluded that RMP is synthesized as a 430 amino acids long precursor with a propeptid of 69 amino 
acids. A putative signal peptidase processing site (von Heijne. EurJ.Biochem. 133. 17-21. 1983) in this 

35 precursor 001116 1)0 between Ala(-48) and Arg(-47), and the mature RMP will be generated by autoproteolytic 
cleavage between Glu-1 and Ala(+1). The cDNA deduced amino acid sequence of RMP is in good 
agreement with the previously published partial amino acid sequence (Bech and Fottmann.) Neth-Milk Dairv 
J. 35: 275-280, 1981). * 

To facilitate further construction work with the RMP cDNA's, a Hindlli linker was inserted at a Banl site 

4a just 3 ' to ^ TAA-termination codon identified in clone pRMP2931 as follows: 25ug pRMP2931 was 
digested with Pstl to obtain the RMPcDNA insert This insert was purified by 1% agarose gel elec- 
trophoresis, electrocuted from the gel. purified by phenol and cloroform extractions and precipitated with 
NaCI and ethanoi. This fragment that encodes the 3' half of the RMP. was digested with Banl and the Banl 
cohesive restnction site ends were filled in with a mixture of the four dNTP's and the Klenow fragment of E. 

45 DNA Polymerase. To these filled-in ends were added Hindlli linkers in a T4-DNA ligase reaction The 
ligation reaction mixture was extracted with phenol and chloroform and the DNA was precipitated with 4M 
NH4* acetate/ethanol. The purified DNA was digested with an excess of Hindlli enzyme, and a 380 bp 
fragment was purified on a 6% polyacrylamide gel. This fragment that contains the 3' end of the RMP open 
reading frame plus the TAA termination codon was ligated to a Hindlli digested and alkaline phosphatase 

so Seated plCl9R The ligation mixture was used to transform competent E. coH cells, and transformants were 
selected on ampiclllin containing agar plates. Plasmid DNA was purified from transformants and correct 
recombinants were identified by restriction endonuclease digestions and agarose gel electrophoresis. From 
one such correct recombinant, pRMP3*. a 210 bp Bglll/Hindlll fragment was isolated by 6% polyacrylamide 
gel electrophoresis. This fragment contains the 3'end of RMP cDNA from the Bglll site at amino acids 297- 

55 299 and extending through the TAA-termination codon to the inserted Hindlli linker. 

The 5* part of the RMP cDNA was isolated from pRMP 1016 as a 997 bp Hindlll/Bglll fragment bv 1% 
agarose gel electrophoresis. The Hindlli site is located in the RMP-DNA at a position corresponding to the 
[r?m e w "^: ~? "I * 6 prOSegment - ™ s 997 °P 5 ' fragment was ligated to the 210 bp 3" fragment in a 
Hindlli digested and phosphatase treated P IC19R. With this ligation mixture, recombinants were obtained 
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from E. coll and a correct plasmld. pRMP, with the 5' part of RMP joined to the 3' part was Identified by 
restriction exzyme analysis. The construction of pRMP is illustrated In Rg. 5. pRMP does not encode the 
RMP preregion and the 5' half of the prosegment 



Example 4 

Construction of an Aspergillus expression vector designed to obtain secretion of active RMP 

In this example a plasmld was constructed designed to express RMP under control of the giucoamylase 
promoter, signal and terminator sequences. The giucoamylase promoter and terminator sequences were 
derived from the giucoamylase genomic gene clones in vector pCAMG91. The construction of pCAMG91 is 
described by Boel et al. (EMBO Journal 3 (1984), 1581-1585) and an endonuclease restriction map of 
plasmid pCAMG91 is shown in Rg. 6. 

pCAMG91 was digested with Sail and Pstl restriction endonucleases. From such a digest a 698 bp 
fragment was isolated on an agarose gel. This Sall-Pstl fragment contains the region encoding the 140 bp 3- 
untranslated part of the giucoamylase mRNA plus 540 bp 3' to the poly(A)-addition site. This 3fragment was 
treated with T4-DNA polymerase to "blunt end" the restriction sites before the addition of Xbal linkers and 
digestion with Xbal restriction enzyme. This 3' end of the giucoamylase gene was ligated to pUC13 
linearized with Xbal to create plasmid pAMG/Term containing the giucoamylase gene poly(A) addition 
region. The construction of pAMG/Term is illustrated in Rg. 7a. 

The 3'end of the A. niger giucoamylase gene was obtained as a 700 bp Xbal fragment from 
pAMG/Term. This terminator fragment was ligated to Xbal digested and phosphatase treated pJC19R. With 
this ligation mixture recombinants were obtained obtained from E. coli and a correct plasmid. 
pICAMG/Term, with the 5'end of the terminator fragment facing the Hindlll site of the multiple cloning site of 
the PIC19R vector was identified by restriction enzyme analysis. The construction of pICAMG/Term is 
illustrated in Rg. 7a. From pICAMG/Term the giucoamylase terminator (AMG terminator) region was isolated 
as a 750 bp Hindlll/Clal restriction fragment by 1% agarose gel electrophoresis. From pCAMG91 the 
giucoamylase promoter (AMG promoter) was isolated together with regions encoding the giucoamylase 
signal peptide, hexapeptide-prosegment and the pBR322 ampicillin resistence gene (Amp) as a 3.5 kb 
Clal/BssHII fragment by 1% agarose gel electrophoresis. A synthetic BssHII/Hindlll linker was prepared from 
two synthetic 31'mer oligonucleotides synthesized on an Applied Biosystems Inc. DNA-synthesizer. The 
synthetic linker has the following structure: 



RVSKQSESKD 
CGCGTAAGTAAGCAGAGCGAGAGCAAGGATA 

ATTCATTCGTCTCGCTCTCGTTCCTATTCGA 

This linker was used in a ligation reaction with the 3.5 kb giucoamylase promoter containing fragment 
and the 750 bp giucoamylase terminator containing fragment. The ligation mixture was used to transform E. 
coli and a correct recombinant, p673. was identified by restriction endonuclease digestion. The isolated 
p673 is a Hindlll cloning vector, into which an appropriate Hindlll cDNA fragment can be inserted between 
the giucoamylase hexapeptide prosegment and the giucoamylase transcription terminator region. The 
inserted cDNA will be under transcriptional control by the giucoamylase promoter, and secretion of the 
translated fusion product will be directed by the giucoamylase signal peptide plus the giucoamylase 
hexapeptide prosegment p673 was digested with Hindlll, treated with alkaline phosphatase and ligated with 
a 1.2 kb Hindlll fragment purified from a digest of pRMP. 

The ligation mixture was used to transform E. coli and a recombinant, p686, with the RMP cDNA 
inserted in the correct orientation to achieve RMP expression was isolated and characterized by restriction 
endonuclease digestions. p686 encodes a RMP precursor with the following structure: giucoamylase signal 
peptide, glusoamylase hexapropeptide, amino acids -45 to -1 of the propeptide from RMP, 381 amino acids 
of mature RMP. The construction of p686 is illustrated in Rg. 7b. 
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Example 5 



mJZJi ^!? ra J!l!? 0iii T ent . 0f present inv «rtJon the open reading frame of preproRMP should be 
Z*% ^f^S? 9 _^° m ^ To *> Ws- a BamHI restriction endonuclease site was insertS 

^sS^.^^h^r SeqU6nCe W3S Synth6Sized - **> oiigonucteobdes on an^pSd 

M L F S 
GATCCACCATGCTGTTCTC oligo 697/698 
GTGGTACGACAAGGAAGT 

3- «^ iS r^ 9 ^ nt r haS a J^' C ° heSiVe 8nd 5 ' to *« bating Met-codon and a Ddel cohesive end in the 
3 end. The two oligonucleotides were kinased with ATP and T4 polynucleotide WnaJ~™SjT2, ^2 

ToZtToZ^ 9 ^^ ? bP DdeI/Hind,l, RMP ^9-nt puriTeTf^^P^^n^H^S 
wanfELn£2 Rgati0n mixture was used to transform E. coli cells, and cwect^rnVrS 
were identified by restncbon enzyme digestions on miniprep purifieTplasmids Correct recomb^S 
ESS? W8r ! ; ^!. nC8d to ^ *• ^"«nce of the oligonucleotides useTo7e mS^SSZ 
25 c^nn RMP ^ BamH ' ^ Hind " 1 - ^ a 110 bp BamHi/Hindm fragment wTth?Sn?S 

^^LnZJ 9 lL^^ and part of the RMP prosegment was purified by 10% poly aery lam S 

fragnient Z! S etectroehrt8d - P""™* and CHCI, extracted and ethar^SE The 
SSr din^rL^ ^""9 frame and the AMG terminator sequences were obtained fm^lasmW pS 
ater digestion wrth EcoRI and partial Hindlll digestion. Hereby a 1.9 kb fragment was released andlS 
OT fragment was punfied by agarose gel electrophoresis, electrocution, phenol and CHCI, eSrac7on before 

ttSSTSS-ZX t^SZT ^ to ** 110 bP ^"^ndlil^en^ p^ 



Example 6 



40 ^ZT^Z^Z^r^ 6 f reSSi ° n vector ^ned to obtain secretion of active RMP in A. oryzae by 
means of the Aspergillus niger glucoamylase promoter ~ y oy 

Bssf S 8 riS^ y,a ? P T 0ter W3S iSOlat8d 88 fo,Iow5 - 25 ug of pCAMG91 was digested with EcoRl and 
BssHII restncbon endonucleases. After this double digestion a 270 bp DMA fragment could be isolated^ 

aoar^ 2 1 f!^ t glucoamylase gene (AMG gene). After electroelution of the DMA froTthe 
So^^^^T Z** PUrifled by pheno1 «■ «** extractions before ethanol predprtatiTlhe 
KSJS Wa ! J!" ? i9ested SfaNI - ™» enzyme has a cleavage site just V to ttTinrtiating 
JSSSSJ^ » 0^ *e glucoamylase gene. After complete digestion, the DMA was treated wSHZ 

60 S rS^T ( h!I! OVV 2 t °T P°'y mera se I and all four dNTP's to generate blunt ends on the dZ.To 
nSnoZ !f added Bglll linkers with DNA ligase. and the DNA was digested with an excels OBgrn 
restncbon enzyme. After separation of the DNA fragments on a 10% polyacrylamide qel a 175 bo S 
fragment could be isolated by electroelution. This fragment has a SgTST liJSL m a oosSon 
ST^'Kf ^N" potion site lust 5' to the bating methionSe cZZ tETSL of DnTwS 

55 fraSrm" e^SST? ^™ P h °*P"atase treated p«C19R vector, and the ligation m£lS was ^ 
transform E. cohcells. Among the resulting transformants correct plasmids were identified bv^**EL™ 

STo^SToTb^ *"* «*»* P>asmi P d^S.i W wi "JSS 

Bglll to liberate a 0.16 kb fragment which contained the 5' untranslated region of the glucoamyiasT gene 
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together with approximately 100 bp of the 3' part of the promoter region. This fragment was purified by 
polyacrylamid gel electrophoresis, electroeluted. phenol and CHCI, extracted and ethanol precipitated To 
join this fragment to the remaining part of the glucoamylase promoter region from P CAMG91, the following 
steps were earned out 25 ug P CAMG91 was digested with BssHII, and then further partially digested with 

s Ndel. After filling in the fragment ends with all four dNTP's and the Wenow fragment of DNA polymerase a 
1.4 kb DNAfragment was isolated on a 1% agarose gel. This fragment contained all of the promoter region 
together with the 5' untranslated region and the signal peptide encoding region. The fragment was 
electroeluted, phenol and CHCI, extracted and ethanol precipitated to concentrate the DNA After digestion 
with! Nsil, the DNA was run on a 1% agarose gel, and a 1.2 kb Ndel -Nsil fragment was isolated by 

10 electrdeluhon. This DNA had been given a blunt end at the Ndel site in a previous reaction and it was now 
ligated to the 0.16 kb Nsil -Bglll fragment from pB401.1 in a Nrul -Bglll digested plC19R vector. The ligation 
m,xtur ® V ^ usedto transform I- £oli cells, and among the resulting transforments correct recombinants 
W o?L o V restriction enzyme digestions of miniprep plasmids. One such correct recombinant, 

PB408.3 was digested with Hindlll and Bglll, and the glucoamylase (AMQ) promoter was isolated as a 1 4kb 

re fragment of a 1% agarose gel. The fragment was electroeluted, phenol and CHCI, extracted and ethanol 
precipitated. This glucoamylase promoter fragment was then ligated to a 2.0 BamHl-EcoRI fragment from 
pRMPAMGTerm (see example 5) in a Hindlll-EcoRI digested pUC19 vector. The ligation mixture was used 
to transform E. coli cells, and among the resulting transformants correct recombinants were identified by 
restriction enzyme digestions of miniprep plasmids. One such correct recombinant p778 was grown in large 

20 scale for isolation of recombinant plasmid and the plasmid preparation was purified by CsCI/Ethidium 
bromide eentrifugation. This plasmid directs the synthesis of RMP under control of the glucoamylase 
promoter and terminator sequences. The construction of p408.3 is illustrated in Fig. 9a and the construction 
of p778 is illustrated in Fig. 9b. ~ 



Example 7 

Construction of Aspergillus expression vector designed to obtain secretion of active RMP by means of the 
Aspergillus oryzae TAKA-amylase promoter 

30 

50 ug of plasmid pTAKA17 (see example 2) which contains the Aspergillus oryzae TAKA-amylase 
genomic gene was digested with Sail. This enzyme has a restriction site in the genomic DNA at a position 
corresponding to amino acid residue 26 of the mature TAKA-amylase. Another Sail restriction site is located 
app. 1300 nucleotides upstream to this position that is in the 5'end of the upstream promoter region. After 

35 Sail digestion this 1300 bp promoter containing fragment was purified by agarose gel electrophoresis and 
the DNA was purified by phenol and CHCI, extractions and ethanol precipitated. The DNA was then 
dissolved in exonuclease III buffer and digested with exonuciease III according to Henikoff, S. (Gene. 28: 
351-359, 1984). The reaction was stopped to obtain approximately 130 bp deletions in each end of the 
DNA. The deletion of app. 130 bp from the Sail site of the coding region of the TAKA-amylase gene in this 

to way gives the opportunity to introduce multiple cloning site linkers just upstream of the initiating methionine 
codon. The exonuclease III treated DNA was digested with SI nuclease according to Henikoff, S. (Gene, 28: 
351-359. 1984) and precipitated with ethanol after phenol and CHCI, extractions. Repair of the SI nuclease 
treated DNA to obtain ligatable blunt ends were done with all four dNTP's and the Wenow fragment of DNA 
polymerase I according to Henikoff, S., (Gene. 28: 351-359. 1984). The DNA was digested with EcoRI which 

« cuts once in the 1300 bp Sail fragment to generate two groups of fragments. One group was about 380 bp 
long and represented upstream regions while the other group was about 620 bp long and contained the 
promoter region. These groups of EcoRI digestion products were separated on an agarose gel, and the app. 
620 bp long DNA fragments were electroeluted and ligated to an EcoRI/Smal digested PUC19 vector. The 
ligation mixture was used to transform competent E. coli cells, and miniprep plasmid DNA was isolated from 

so the recombinants. These deletion mutants were characterized by restriction enzyme digestions to identify 
plasmids with deletion end points just 5' to the initiating methionine codon. A few candidates with the 
desired characteristics were sequenced and a mutant (p9) that had 9 bp deleted 5' to the A in tine ATG- 
methionine codon was chosen for further constructions. p9 was digested with EcoRI and Hindlll. and a 645 
bp TAKA-amylase promoter containing fragment was- isolated by agarose gel electrophoresis, phenol and 

ss CHCI, extracted and precipitated with ethanol. pTAKA17 was digested with Sail and EcoRI and a 510 bp 
fragment containing the TAKA-amylase-promoter upstream regions was isolated by agarose gel elec- 
trophoresis, phenol and CHCI, extracted and precipitated with ethanol. These two promoter regions were 
ligated to each other and to a plCl9R vector, that had been digested with Sail and Hindlll. The ligation 
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mixture was used to transform E. con cells, and correct recombinants were identified by restriction enzyme 
digestion of plasmids extracted as minipreps. in one such recombinant p719 the TAKA-amylase promoter 
region from Aspergillus oryzae is found as a 1.1 kb portable fragment that can be excised by a number of 
various restriction enzyme digests. The construction of p719 is illustrated in Fig. 10. 

s From pRMPAMGTerm the preproRMP open reading frame and. glucoamyiase terminator region - 
(AMGTerm) was isolated as a 2 kb fragment after digestion with BamHI and EcoRl. This fragment was 
purified by agarose gel electrophoresis, phenol and CHCI, extractions and then concentrated by ethanol 
precipitation. The promoter from the TAKA-amylase from A. oryzae was now isolated as a 1.1 kb fragment 
obtained after digestion of p7l9 with Sail and BamHI. This fragment was purified by agarose get 

io electrophoresis, phenol and CHCI* extractions and then ethanol precipitated. The 1.1 kb promoter fragment 
was Hgated to the 2 kb BamHI/EcoRI fragment from pRMPAMGTerm in a pUCl9 vector, that had been 
digested with Sail and EcoRI. The ligation mixture was used to transform E. coii ceils and among the 
resulting transform ants correct recombinants were identified with restriction enzyme digestion of mini prep 
plasmids. One such correct recombinant p777 was grown in large scale for the isolation of recombinant 

t5 plasmid, and the plasmid preparation was purified by CsCI/Ethidium bromide centrifugation. The construc- 
tion of p777 is illustrated in Fig. 11. 



Example 8 

20 

Construction of an Aspergillus expression vector designed to obtain secretion of the Rhizomucor miehei 
lipase under control of the Aspergillus oryzae TAKA-amylase promoter 

Construction and identification of a lipase cDNA clone in E. coir 

25 

in order to obtain information which allows the construction of a specific oligonucleotide probe, a partial 
sequence determination was carried out on the purified Rhizomucor miehei lipase (Moskow'rtz, G.J. et aL, 
J-Agric. Food Chem., 25 (1977), 1146-1150). In the following text the abbreviation RML is used for the 
Rhizomucor miehei fipase.The supernatant from a culture broth of Rhizomucor miehei. from which mycefia 

30 and low molecular weight substances had been removed was subjected to anion exchange chromatog- 
raphy. The main lipolytic fraction from the column was desalted and ultrafiforated prior to lyophilization. The 
lyophlfized powder was then subjected to an affinity chromatography. The pooled lipase fractions from the 
column were desalted and concentrated by ultrafiltration. This concentrate was then subjected to a 
hydrophobic interaction chromatography (HIC) and the lipase from the HlOpurification was used for amino 

35 acid sequence determination. The sequence determination was carried out both on the native enzyme (N- 
terminal seqeunce) and on selected fragments obtained after proteolytic digestion of the lipase with 
Armillaria mellea protease. The sequence determination was performed with a Gas Phase Sequencer - 
(Applied Biosystems Model 470A) as described by Thim, L et al. (FEBS Lett 1987, in press). 

RML was digested with Armiliariamellea protease as described by Moody et al. (FEBS Lett. 172 (1984), 

40 142-148) with the only exception that the enzyme to substrate ration was 1:40 (moirmoi). Fragments 
obtained were separated by HPLC and the UV-absorption was monitored at 280 nm and 214 nm. In order to 
identify suitable fragments for the construction of oligonucleotide probes, only peptides which showed a 
high ratio between 280 nm and 214 nm were sequenced as these fragments contain Trp and/or Tyr. 
The following N-terminal sequence was found by use of the native RML: 

^ 5 10 15 

Ser-Ile-Asp-Gly-Gly-Ile-Arg-Ala-Ala-Thr-Ser-Gln-Glu-Ile-Asn- 

20 25 

50 Glu-Leu-Thr-Tyr-Tyr-Thr-X-Leu- ( Ser ) - (Ala ) - . 

One of the fragments isolated from the proteolytic digest had the sequence of: Arg-Thr-VaHle-Pro-Gly- 
Ala-Thr-Trp-Asp-X-Ile-His, and this fragment was used for the synthesis of a specific oligonucleotide probe. 

The Rhizomucormiehei cDNA library from example 3 constructed for isolation of the aspartic proteinase 
55 (RMP) recombinants from this organism was also used for identification of lipase specific recombinants. A 
mixture of oligonucleotides was synthesized on an Applied Biosystems Inc. DNA-synthesizer. The mixture 
which has the structure: 
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5 ■ A TCCCANGTNGCNCC 3' 430/431 
G 



10 



15 



20 



26 



30 



36 



40 



46 



50 



was complementary to RML mRNA in a region encoding the amino acids G ly- Ala-Th r-Trp- Asp. This 
pentapeptide was identified as a segment of an amino acid sequence obtained from proteolytic fragments 
of the purified RML protein (see above). 

The Rhizomucormiehei cDNA library was screened with the 32 p-kinased lipase oligonucleotide mixture 
as described for screening with the RMP specific mixture. Hybridization and initial washing of the filters 
were done at 43°C. After autoradiography, the filters were washed at 47°C. Colonies that showed strong 
hybridization were isolated and the inserted cDNAs in the corresponding plasmlds were sequenced to 
identify RML specific recombinants. Two such recombinants p353.7 and p353.16 had inserts of about 1.2 
kb. The DNA sequence obtained from these two recombinants starts In the middle of the signal peptide - 
(see Fig. 12) and extends through to the poly A tall. In this region one long open reading frame could be 
identified. Since the two recombinants did not include sequence for the 5' part of a signal peptide with Its 
initiating methionine codon a synthetic oligonucleotide (584) 
5' CGAGAGGGGATGAG6GGTGG 3* 584 

was synthesized. This oligonucleotide 584 is complementary to the RML mRNA in a region encoding the 
amino acid sequence Pro-Pro- Leu-IIe-Pro-Ser-Arg found in the propeptide region (see fig. 12). After the 
oligo 584 had been kinased to high specific activity with T4 polynucleotide kinase and S2 p- r ATP, it was 
used in a primer extension reaction on Rhizomucor miehei mRNA with AMV reverse transcriptase according 
to published procedures (Boel, E. et al.. PNAS, USA, 80. 2866-2869, 1983). The primer extension reaction 
products were electrophoresed on a 10% polyacrylamide/urea gel and two cDNA products were resolved. 
These two cDNAs, one 150 nucleotides long and the other 160 nucleotides long were both electroeluted 
and sequenced by the chemical degradation procedure for DNA sequencing. Both cDNAs gave readable 
sequence extending from the primer region and up to a position 9 nucleotides 5' to an initiating methionine 
codon. The sequence confirmed the sequence obtained from the lipase recombinant cDNA plasmids. The 
lengths of the two primer extensions cDNA products indicate that the 5'end (CAP-site) of the lipase mRNA 
will be located app. 5 or 15 nucleotides 5' to the first A nucleotide shown In Fig. 12. A microheterogeniety in 
the location of the 5'end of mRNA's from fungi is very common. By combining the sequence obtained from 
the two cloned cDNAs p353.7 and p353.16 with the sequence from the primer extension analysis, the amino 
acid sequence of a RML precursor can be established. The DNA-sequence and the corresponding amino 
acid sequence of the RML precursor is shown In Fig. 12. In Fig. 12 the horizontal lines indicate the 
positions of synthetic oiigos used for cDNA synthesis and for cDNA library screening. An arrow shows the 
position where processing occurs in maturation of native RML Nucleotides are numbered from the first 
base in the initiating Met-codon and amino acids are numbered from the first residue in the mature native 
RML The RML is encoded by an open reading frame extending from an initiating Met codon and then 
through 363 codons before a stop codon is reached. In this precursor the first 24 amino acid residues would 
constitute a typical hydrophobic signal peptide. According to the productive rules of von Heijne 
(EurJ.Biochem. 133, 17-21, 1983), the signal peptide would be cleaved from the following propeptide by a 
signal peptidase cleavage between the Ala-and Val residues at position -71 and -70, respectively. 

Since the N-terminal amino acid sequence analysis of purified RML obtained from the culture 
supernatant from Rhizomucor miehei identified Ser-lle-Asp-GIy-Gly-lIe-Arg as the N-terminus of the active 
RML enzyme, the propeptide of the RML precursor consisted of the next 70 amino acid residues in the 
precursor. Beginning with this N-terminal Ser residue, the mature RML extends through 269 residues before 
reaching a termination codon. In this mature 29500 dalton enzyme a lipase substrate binding site is located 
around residue Ser(144) which is conserved in a number of lipases. In the 3' end of the RML mRNA 104 
nucleotides were localized as an untranslated region between the TAA stop codon and the poly(A) tail. 23 
nucleotides 5' to this poiy(A) tail a repetitive structure consisting of 7 AT basepairs was found while no 
typical eukaryotic polyadenyiation signal could be identified. 

In a preferred embodiment of the present invention a number of changes was carried out on the RML 
cDNA. These changes involved removal of the G:C tails added to the cDNA during cloning and addition of 
restriction endonuclease sites 5' and 3' to the open reading frame. A number of convenient restriction sites 
were also introduced in the signal peptide and propeptide regions of the cDNA. 

P353.16 was digested with FnuDll and a 880 bp DNA fragment (the 3'end of RML cDNA) was isolated 
by agarose gel electrophoresis. The fragment was electroeluted, phenol and CHCI 3 extracted and precipitat- 
ed with ethanol. 
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im ™ S jT ,^- u CD ^ was then ,iC;ated to a PUC19 vector that had been digested with SmaJ and 
***** T*^^*!***' ^ li9afi0n reacfion to transform competent EL cSi cTlte 2 

mTnin^rn^^. r 13 "* 3 C0TOCt ^binants were identified by restricflSTeW^gesto Trt 
^S kb fl^L? 6 f PProPriate recombinant P435-2 was digested with Ban II Hindm^d a 
CHC^S^J^V^^.T^ 961 etectra P h °^s. The fragment was electroeluted. phenoTand 
CH% extracted and preoprtated with ethanol. This fragment of RML cDNA had a major part of the pUC19 
multiple cloning site Joined to its & untranslated region. ^ puuia 

The 5' end of the RML cDNA was redesigned using synthetic oligonucleotides in order to introduce 
^^^."^ DNA-^quence of the sy%h^fragmenV(RMLto fe snownln HgT^S 
^ni?^H i° inin 9 sites of the indrvidually synthesized oi^^s 

^ Jl ^ ViZ " vertcaWl °ri«>maJ lines. The resulting fragment (RML 57 waspunWaH 

fragment from P435.2 In a BamHI/Hindlll digested P UC13 vector. The ligation reaction waTuseTto 

SSb^reSSIn 1 . — T ^ a™ 9 *° ""0 t-nsformants 9 correcTr^rnSnlnte^eS 
S^EJX ^stocfaon enzyme digestions on miniprep plasmids. In one such correct recombinant pB544 

rFr^ Fr^ ^Zr*** * 2? "V* eXPeCted * UCture - ^ construction of pBs£ iTtaES 
A PB544 *? Prepro cDNA was isolated as a 15 Kb BamHI fragment by agaresVoel 
J!Tfl°^ S t - *" l^T 350 " V8Ct0r ^ on 106 P rom ^r from the Aspergillus oryzae TA^WlaS 
itZZ ^ t T inat0r , fr0m 1,18 ^R" 1 " 5 Hiaer glucoamylase gene was prepare d^llowTp^^^ 
8X ^P te J> Rested with Sail and BamHI. The resulting 1.1 kb TAKA-amylase promoWfn^ment wS 

E£f t£ rSnt r?^ ^T Ph0reS ! S - P,CAMG "" e ™ <-e example 4, JTSETStfEK 25 
EcoRL The esutongOJS kb glucoamylase terminator fragment was purified by agarose gel electroohoreSs 

2ra£T? ^ C ^ eXtrart0nS 01858 ^ frasments were emano. pn^prta^ and iSiT a 
^^Rldigeeted P UC19 vector. The ligation reaction was used to transform? cofi cette andWng th^ 
S transforms* correct recombinants were identified by restriction enzVmTdigestlS S 
P'^' d t c 0n8su f o recombinant P 775 was digested with BamTTand treated wS, aS 

Sr^n^Trn^ ^ from PB544 was ligated is pT^ vSTr 

and transformed intoE. cojj. A recombinant p787 with the RML prepro cDNA inserted in the conect 

l^° n ^^ rom0ter tBrminator ^ id8frtified by restS/enzyme dSoWoTmSS 
plasmids extracted from E. con transfoimants. P 787 plasmid DNA was grown interne Sa^dthe^ZSd 
preparation was purified by CsCI/Ethidium bromide centrifugation. The^nsJci^of^TlS^S 



Transfbmiation of Asperoillus oryzae (general procedure) 

i„nJ.2L? °I YPD (Sh8 n an * ^ Mettlods in Y «ast Genetics, Cold Spring Harbor Laboratory, 1981) was 

ToT^^ZT-^ 2 tP^' '? 4177 ° r mutente hereof ^ Abated with 

O s T^nln ^,1^ ^ WaS harvest8d b y fixation through miracloth and washed with 200 ml of 

0.6 M MgSO.. The mycelium was suspended in 15 ml of 15 M MgS0 4 , 10 mM NaH,PO, pH = 5 8 The 

^ ° n . ^ " d 1 m ' * buffer 120 mg of NovozymTS 'b^toh i££ wat 

« .!!! n - 1 m ' ° f 12 m9/ml BSA < Si 9 m a type H25) was added and incubation witTgen^ 

" 37 ° Q - a ^ number * protopbsts was viab,e r rp " 

f" 5 ^ 05 * 0 " ^ filtered trough miracloth. the filtrate transferred to a sterile tube and overlayered 
°f°-fM sorbitol, 100 mM Tris-HCI, pH = 7.0. Centrifugation was performed to l?Sh?iOOo! 

mM Tris^pT-Ts Sen 9 ^ "XT** 2 VO,Um6S <* STC (1^ M -l^°iS 

!H«m* I *~ 2) Were added to protoplast suspension and the mixture was 

centrrfugated for 5 m,n. at 1000 g. The protoplast pellet was resuspended tasSrt ^C^reolSete? 
Th.s was repeated. Hnally the protoplasts were resuspended in 0^-lml ofSTC repelleted. 
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100 ul of protoplast suspension was mixed with 5-25 ug of the appropriate DNA in 10 ul of STC. 
Protoplasts from the argB strains were mixed with pSal43 DNA (an A. nidulans argB gene carrying 
plasmid) and protoplasts from the argB* strains were mixed with p3SR2 (an A. nidulans amdS gene 
carrying plasmid). The mixture was left at room temperature for 25 min. 0.2 ml of 60% PEG 4000 (BDH 

5 29576), 10 mM CaCI* and 10 mM Tris-HCI pH = 7.5 was added and carefully mixed (twice) and finally 0.85 
ml of the same solution was added and carefully mixed. The mixture was left at room temperature, for 25 
min. t spun at 2500 g for 15 min. and the pellet was resuspended in 2 ml of 1.2 M sorbitol. After one more 
sedimentation the protoplasts were spread on the appropriate plates. Protoplasts from the argB strains 
transformed with pSa!43 were spread on minima! plates (Cove, Biochem.Biophys.Acta 113 (1966) 51-56) 

io with glucose and urea as carbon and nitrogen sources respectively, and containing 1.2 M sorbitol for 
osmotic stabilization. Protoplasts from the argB* strains transformed with p3SR2 were spread on minimal 
plates (Cove, Biochem.Biophys.Acta 113 (1966) 51-56) containing 1.0 M sucrose, pH = 7.0, 10 mM 
acetamide as nitrogen source and 20 mM CsCI to inhibit background growth. After incubation for 4-7 days 
at 37°C spores were picked, suspended in sterile water and spread for single colonies. This procedure was 

re repeated and spores of a single colony after the second reisofation were stored as a defined transformarrt. 



Example 10 

20 Expression of TAKA-amylase in a wild type A. oryzae strain 

pTAKA17 was transformed into A. oryzae IFO 4177 by cotransformation with p3SR2 containing the 
amdS gene from A. niduians as described in example 9. Protoplasts prepared as described were incubated 
with a mixture of equal amounts of pTAKA 17 and p3SR2, approximately 5 ug of each were used. 9 

25 transform ants which could use acetamide as sole nitrogen source were reisolated twice. After growth on 
YPD (Sherman et a!, 1981) for three days culture supematants were analysed by SDS-PAGE. The gels were 
stained with coomassie brilliant blue R. The best transfbrmants produced 10 -20 times more amylase than 
urrtransformed IFO 4177. One tranformant was selected for further studies and grown in a 2 liter Kieler 
fenmentor on 4% soy bean meal and supplied with glucose during growth. The culture was heavily agitated 

30 during fermentation. Under these conditions IFO 4177 gave about 1 g/I and the transformarrt about 12 g/l of 
amylase determined as enzyme activity. Enzyme activity was measured as ability to degrade starch (Cereal 
Chemistry, 16 (1939), 712-723). The starch used was Merck Amylum solubile erg B.6 and the assay was 
performed at pH 4.7 and at 37* C. No external beta-amylase was added. 

35 

Example 11 

Expression of RMP in A. oryzae 

40 p7T7 from example 7 or p778 from example 6 was transformed into IFO-4177 by cotransformation with 
p3SR2 by the procedure described in example 9. Transform ants were selected and reisolated as described 
in example 9. 

Transformants were grown for three days in YPD and supematants were analysed by SDS-PAGE 
followed by Western blotting and EUSA. The supematants from transformants of both p777 and p778 

46 contained from 50 -150 mg/l of protein reacting with RMP antibody. The proteinase was overglycosylated 
compared to the R. miehei produced proteinase. Two forms were seen of which one is presumed to be a 
proform and the other the processed mature proteinase. Two transformants of p778 and three transformants 
of p777 were grown in a fermentor in the same way as the TAKA-amylase transformants described above. 
The two transformants of p778 gave approximately 02. g/l and 0.4 g/l and the three transformants of p777 

so gave approximately 0.5 g/l, 2.4 g/l and 3.3 g/l of RMP determined as milk clotting activity by the Kunltz 
method (Kunltz M., Jour.Gen.PhysioI. 18 (1935), 459-466), assuming that the specific activity of the 
recombinant RMP is the same as that of the Rhizomucor miehei enzyme. (This has later been confirmed). 
SDS-PAGE and SDS-PAGE followed by Western-blotting and EUSA revealed that only one form of RMP 
was present when culturing in a larger scale. The RMP was overglycosylated also under these growth 

ss conditions. The protein amount seen on gels correlated well with the amounts predicted from enzyme 
activity. 

RMP was purified from the culture supernatant by affinity chromatography and size exclusion 
chromatography. 
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The N-terminal sequence of the purified recombinant RMP was determined by use of a Gas Phase 
Sequencer as described by Thim et aJ. (FEBS Lett, 1987 in press). 

Two forms of the recombinant RMP were found indicating that the processing in the N-terminal end was 
heterogeneous Oto form had the N-terminal sequence of: Ala-Asp^Iy-Ser-Vai-Asp-Thr-Pro-Gly-Tyr-and 
the other form had the N-terminal sequence of: Gly-Ser-Val-Asp-Thr-Pro-Gly-Tyr-Tyr-Asp-. Such heteroge- 
2fT E!^iri^ ^ N " terminus h«s also been described for native RMP from Mucor miehei (PaquetD 
et al.Neth.Milk.Da.ry J., 35 (1981). 358-360). As the heterogenous processing of the recombinant RMP 
correlates well with that of native RMP. A. oryzae has according to the present invention been shown to be 
able to process recombinant RMP in the correst region. 

Example 12 

Construction of an Expression unit for the production of prochymosin in A. oryzae 

The construction contains the prochymosin gene immediately preceded by the signal peptide sequence 
from the A. oryzae TAKA-amylase gene under control of the A. oryzae TAKA-amylase promoter The 
construct further contains the terminator from the A. niger glucoamylase gene plus an E. coli replicon ' 

An approximately 430 bp BamHI/Xmal fragment from p285' proC (see exampTei)~and a synthetic 
oligomer of the following sequence 

AATTCCAGCTGCCGCGGCCGAGATCACCAG 

GGTCGACGGCGCCGGCTCTAGTGGTCCTAG 
were inserted into EcoRI-Xmal cut pUCl9 plasmid giving plasmid pToC50a. 

pToCSOa was cut wfth EcoRI-Sacll and the large fragment containing pUCl9 and the 5' part of the 
prochymosin gene (prochymosin*) was isolated. This fragment was ligated with a 0.6 kb EcoRI-Banl 
fragment from pTAKA 17 and the following synthetic oligomer 

GCACCTGCTTTGGC 

GACGAAAC (KFN 280/281) 

After transformation a plasmid pToC51 containing the 5' part of the prochymosin gene (prochymosin') 
fused to the signal sequence from the A. oryzae TAKA-amylase gene (preTAKA) and preceded by 
approximately 500 bp upstream TAKA-amylase sequence was isolated. The construction of pToC51 is 
illustrated in Fig. 15a 

pR26 was cut with Hinfi. treated with the large fragment (Klenow) of DNA polymerase I and the four 
dNTP's and cut with Xmal. A 750 bp fragment containing the 3'end of the prochymosin gene was isolated 
With the purpose of inserting a Hindlll at the 3'end of this fragment pUC9 was cut with Xmal/Hincll and the 
large fragment was ligated to the 750 bp fragment containing the 3'end of the prochymosin gene. 

A 5.6 kb EcoRI-Clal fragment from pTAKA 17 was isolated and ligated with a 2.6 kb Clai-Hindlll 
fragment from the same plasmid plus a 0.7 kb EcoRI-Hindlll fragment from pICAMG/Term (see example 4) 
containing the A. niger glucoamylase gene terminator and polyA site. The resulting plasmid is illustrated in 
Fig. 15b as pToC52. 

pToC 52 was cut with Hindlll and partially with EcoRI and a 6.4 kb fragment was isolated. This was 
ligated with a 0.9 kb EcoRI-Xmal fragment from pToCSl and a 0.7 kb Xmal-Hindlll fragment from pUC9PC 
containing the 3" part of the prochymosin gene ('prochymosin). The resulting plasmid is called P ToC56 and 
is depicted in Fig. 15b. 



Example 13 

Expression of prochymosin in A. oryzae 

ow,i! T °^o W f t ^ Sf0rm ? d int0 > 0f y zae ,FO 4177 or an argB mutant thereof by cotransformation with 
(amdS 9ene> 0r P^ 43 < araB 9 ene >- Transformants which grew on selective media were 
reisolated twice as described in example 9. 
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The transformants were grown for three days in YPD and the prochymosin content in supematants was 
analysed by EUSA on a Western blot after SDS-PAGE. The transformants produced 1 -10 mg/l of a 
prochymosin size immunoreactive protein in the supematants. No other immunoreactive proteins were 
* detected in the supematants. 

5 

Example 14 

Expression of RML in A. oryzae 
10 " * ' 

p787 from example 8 was transformed into IFO-4177 by cotransformation with p3SR2 by the procedure 
described in example 9. Transformants were selected and reisolated as described in example 9. 

Supematants from YPD cultures of the transformants grown for three days were analyzed by SDS- 
PAGE followed by Western blotting and EUSA. The best transformant produced 2 mg/t of a protein the size 
is of the matured RML. The lipase activity in the supematants was assayed as the ability to cleave tributyrin - 
(NOVO method AF 95.1/3-GB). 
. The measurement confirmed that 2 mg/l of active lipase was present in the supematants. 
The features disclosed in the foregoing description, in the claims and/or in the accompanying drawings 
may, both separately and in any combination thereof, be material for realising the invention in diverse forms 
20 thereof. 



Claims 

< 

26 1 . A process for expression of a protein product in Aspergillus oryzae comprising the steps of: 

(a) providing a recombinant DNA cloning vector system capable of integration into the genome of an 
Aspergillus oryzae host in one or more copies and comprising: DNA-sequences encoding functions 
facilitating gene expression; a suitable marker for selection of transformants; and a DNA-sequence encoding 
the desired protein product; 

30 (b) transforming the Aspergillus oryzae host which does not harbour a functional gene for the chosen 

selection marker with the recombinant DNA cloning vector system from step a; and 

(c) culturing the transformed Aspergillus oryzae host in a suitable culture medium. 

2. A process according to claim 1, wherein the DNA-sequence encoding functions facilitating gene 
expression comprises a promoter, transcription initiation sites, and transcription terminator and polyadenyla- 

35 tion functions. 

3. A process according to claim 2, wherein the promoter is preceded by upstream activating 
sequences. 

4. A process according to claim 1, wherein Jhe selection marker is derived from the gene for A. nidulans 
or A. niger argB, A. nidulans trpC, A. nidulans amdS, Neurospora crassae Pyr4 or DHFR. 

40 5. A process according to claim 4, wherein the selection marker is the ArgB gene derived from A. 

nidulans or A. niger or the amdS gene derived from A. nidulans . 

6. A process according to claim 3, wherein the promoter and upstream activating sequences are 

derived from a gene encoding an extracellular or intracellular protein, such as an amylase, a gluco amylase, 

a protease, a lipase, a cellulase or a glycolytic enzyme. 
46 7. A process according to claim 6, wherein the promoter and upstream activating sequences are 

derived from the gene for A. oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, A. niger neutral 

a-amylase, A. niger acid stable or amylase, A . niger glucoamylase or Rhizomucor miehei lipase. 

8. A process according to claim 7, wherein the promoter is the A. oryzae TAKA amylase promoter or 

functional parts thereof. 

so 9. A process according to claim 8, wherein the promoter and upstream activating sequences have the 
following sequence: 

GTCGACGC ATTCCGAATA CGAGGCCTGA TTAATGATTA CATACGCCTC CGGGTAGTAG ACCGAG- 
CAGC CGAGCCAGTT CAGCGCCTAA AACGCCTTAT ACAATTAAGC AGTTAAAGAA GTTAGAATCT ACG- 
CTTAAAA AGCTACTTAA AAATCGATCT CGCAGTCCCG ATTCGCCTAT CAAAACCAGT TTAAATCAAC 
55 TGATTAAAGG TGCCGAACGA GCTATAAATG ATATAACAAT ATTAAAGCAT TAATTAGAGC 
A ATAT CAGGC CGCGCACGAA AGGCAACTTA AAAAGCGAAA GCGCTCTACT AAACAGATTA 
CTTTTGAAAA AGGCACATCA GTATTTAAAG CCCGAATCCT TATTAAGCGC CGAAATCAGG 
CAGATAAAGC CATACAGGCA GATAGACCTC TACCTATTAA ATCGGCTTCT AGGCGCGCTC CATC- 
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TAAATG TTCTGGCT6T GGTGTACAGG GGCATAAAAT TACGCACTAC CCGAATCGAT AGAACTACTC 
ATTTTTATAT AGAAGTCAGA ATTCATAGTG TTTTGATCAT TTTAAATTTT TATAT6GCGQ 
GTGGTGGGCA ACTCGCTTGC GCGGGCAACT CGCTTACCGA TTACGTTAGG GCTGATATTT ACGT- 
GAAAAT CGTCAAGGGA TGCAAGACCA AAGTAGTAAA ACCCCGQAAG TCAACAGCAT CCAAGCCCAA 
GTCCTTCACG GAGAAACCCC AGCGTCCACA TCACGAGCGA AGGACCACCT CTAGGCATCG GACG- 
CACCAT CCAATTAGAA GCAGCAAAGC GAAACAGCCC AAGAAAAAGG TCGGCCCGTC GGCCTTTTCT 
GCAACGCTGA TCACGGGCAG CGATCCAACC AACACCCTCC AGAGTGACTA GGGGCGGAAA TTTAAAG- 
GGA TTAATTTCCA CTCAACCACA AATCACAGTC GTCCCCGGTA TTGTCCTGCA GAATGCAATT 
TAAACTCTTC TGCGAATCGC TTGGATTCCC CGCCCCTAGT CGTAGAGCTT AAAGTATGTC 
CCTTGTCGAT GCGATGTATC ACAACATATA AATACTAGCA AGGGATGCCA TGCTTGGAGG ATAG- 
CAACCG ACAACATCAC ATCAAGCTCT CCCTTCTCTG AACAATAAAC CCCACAG or a functionally equiv- 
alent nucleotide sequence. 

10. A process according to claim 8. wherein the promoter and upstream activating sequences have the 
following sequence: 

AGATCTGCCC TTATAAATCT CCTAGTCTGA TCGTCGACGC ATTCCGAATA CGAGGCCTGA TTAAT- 
6ATTA CATACGCCTC CGGGTAGTAG ACCGAGCAGC CGAGCCAGTT CAGCGCCTAA AACGCCTTAT 
ACAATTAAGC AGTTAAAGAA GTTAGAATCT ACGCTTAAAA AGCTACTTAA AAATCGATCT 
CGCAGTCCCG ATTCGCCTAT CAAAACCAGT TTAAATCAAC T6ATTAAAGG TGCCGAACGA 
GCTATAAATG ATATAACAAT ATTAAAGCAT TAATTAGAGC AATATCAGGC CGCGCACGAA AGGCAAC- 
TTA AAAAGCGAAA GCGCTCTACT AAACAGATTA CTTTTGAAAA AGGCACATCA GTATTTAAAG 
CCCGAATCCT TATTAAGCGC CGAAATCAGG CAGATAAAGC CATACAGGCA GATAGACCTC TACCTAT- 
TAA ATCGGCTTCT AGGCGCGCTC CATCTAAATG TTCTGGCTGT GGTGTACAGG GGCATAAAAT TACG- 
CACTAC CCGAATCGAT AGAACTACTC ATTTTTATAT AGAAGTCAGA ATTCATAGTG TTTTGATCAT 
TTTAAATTTT TATATGGCGG GTGGTGGGCA ACTCGCTTGC GCGGGCAACT CGCTTACCGA TTACGT- 
TAGG GCTGATATTT ACGTGAAAAT CGTCAAGGGA TGCAAGACCA AAGTAGTAAA ACCCCGGAAG 
TCAACAGCAT CCAAGCCCAA GTCCTTCACG GAGAAACCCC AGCGTCCACA TCACGAGCGA AGGAC- 
CACCT CTAGGCATC G 6A CGCACCAT CCAATTAGAA GCAGCAAAGC GAAACAGCCC AAGAAAAAGG 
TCGGCCCGTC GGCCTTTTCT GCAACGCTGA TCACGGGCAG CGATCCAACC AACACCCTCC AGAGT- 
GACTA GGGGCGGAAA TTTAAAGG6A TTAATTTCCA CTCAACCACA AATCACAGTC GTCCCCGGTA 
TTGTCCTGCA GAATGCAATT TAAACTCTTC TGCGAATCGC TTGGATTCCC CGCCCCTAGT 
CGTAGAGCTT AAAGTATGTC CCTTGTCGAT GCGATGTATC ACAACATATA AATACTAGCA AGGGATG- 
CCA TGCTTGGAGG ATAGCAACCG ACAACATCAC ATCAAGCTCT CCCTTCTCTG AACAATAAAC 
CCCACAGAAG GCATTT or a functionally equivalent nucleotide sequence. 

11. A process according to claim 10, wherein the sequence in claim 10 is preceded by the 1.05 kb 
unsequenced upstream region from position 0 to 1.05 in plasmid pTAKA 17. 

12. A process according to claim 1, wherein the vector system further comprises a preregion providing 
for secretion of the expressed product into the culture medium. 

13. A process according to claim 12, wherein the preregion is derived from a glucoamylase or an 
amylase gene from an Aspergillus species, an amylase gene from a Bacillus species, a lipase or proteinase 
gene from Rhizomucor miehei. the gene for the a-factor from S. cerevisiae or the calf prochymosin gene. 

14. A process according to claim 13. wherein the preregion is derived from the gene for A. oryzae 
TAKA amylase, A. niger neutral o-amyiase, A. niger acid-stable o-amylase. B. licheniformis a-amylase. the 
maltogenic amylase from Bacillus NCIB 11837, B. stearothermophilusa -amvlase or B. licheniformissubtilisin. 

15. A process according to claim 14, wherein the preregion is the TAKA-amylase preregion with the 
following sequence 

ATGATGGTCGCGTGGTGGTCTCTATTTCTGTACGGCCTTCAGGTCGCGGCACCT 
MetMetValAlaTrpTrpSerLeuPheLeuTyrGlyLeuGlnValAlaAlaPro 

GCTTTGGCT 
AlaLeuAla 

16. A process according to claim 1, wherein the vector system comprises two vectors, where on 
contains the selection marker and the other contains DNA-sequences encoding functions facilitating gene 
expression and a ONA-sequence encoding the desired protein product 

17. A process for production of a protein product in Aspergillus oryzae, wherein an Aspergillus oryzae 
strain being transformed with a recombinant DNA cloning vector system as described in claim 1 is cultured 
in a suitable culture medium and the product is recovered from the culture medium. 

18. A promoter suitable for expression of a protein product in Aspergillus, characterized by being the 
TAKA-amylase promoter or functional parts thereof optionally preceded by upstream activating sequences. 
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sequel "* sequences accordin 9 *> claim 18 having the following 

AGATCTGCCC TTATAAATCT CCTAQTCTQA TCQTCGACGC ATTCCGAATA CGAGGCCTGA TTAAT 

5 ACAA^^GC° G< AG^A^Ar AA AG1 ft3r ACCGAGCAGC CGAGCCAGT^ ^SSS?SS^5^ 
S ACAATTAAGC AGTTAAAGAA GTTAGAATCT ACGCTTAAAA AGCTACTTAA AAATCQATrT 
CGCAGTCCCG ATTCGCCTAT CAAAACCAGT TTAAATCAAC TGATT^AGG T^Jg^cS 
%I AT £?IZ ATATAACAAT ATTAAAGCAT TAATTAGAGC AATATC AGG C CG^CACG ^ AG^C^ 
ScGAA^^ T^^ C J^ CT M*™**™ CTTTTGAAAA AGGCACATCA GTATFT^G 
CCCGAATCCT TATTAAGCGC CGAAATCAGG CAGATAAAGC CATACAGGCA GATAGACCTC TACCTA-r! 

CACTAC CCGAATCGAT AGAACTACTC ATTTTTATAT AGAAGTCAGA ATTCATAGTG TTTTGATCAT 
TTTAAATTTT TATATGGCGG GTGGTGGGCA ACTCGCTTGC GCGGG^AACT TO^C^^CGtT 
TAGG GCTGATATTT ACGTGAAAAT CGTCAAGGGA TGCAAGACCA ^GTaISaA ACCcSg^G 
JESS*?* 1 " CCAAGCCCAA GTCCTTCACG GAGAAACCCC AGCGTCCACA TcAC^GCG^ AG^cf- 
" 5 A ?^^T A ° GCATCG ^aGACCAT CCAATTAGAA GCAGCAAAGC GAMCAGCTO A^A^GG 
SSf C ?J2™ CCTTTTCT GCAACGCTG A TCACGGGCAG CGATCCAACC AACACCCTCCAGUM^ 
??SL GGGGCGGAAA TTTAAAGGGA TTAATTTCCA CTCAACCACA AATCACAGTC GtScCGGT^ 
TTGTCCTGCA GAATGCAATT TAAACTCTTC TGCGAATCGC TTGGATTCCC r.krrrrTAri 

. CcT SKE^^ CCTTGTCGAT QCGATOTATC^CAm^A^AGCAAGG^ 
^J 6 ^??^ 00 ATAGCAACCG ACAACATCAC ATCAAGCTCT CCCTTCTCTG AACAAtX>AC 
CCCACAGAAG GCATTT or a functionally equivalent nucleotide sequence aacaataaac 

a ^^'n A o^. 0m0t8r 811(1 UPStr ! Sm acBva «"9 sequences according to claim 19. wherein the upstream 
Smtd pTA^A e ?7 6S ^ Y *• 1,05 * Unsequenced "P 8 *"*™ from position 0 toTSTn 



plasmid pTAKA 17. 

^ sequent Pr ° m0ter ^ upstream activating sequences according to claim 19 having the following 

^l^ A ^?„*I TCCGAATA CGAGG CCTGA TTAATGATTA CATACGCCTC CGGGTAGTAG ACCGAG- 
^ C A ? GAQCCAGTT G AGCGCCTAA AACGCCTTAT ACAATTAAGC AGTTAAAGAA G^AGAA"rCT ACG- 
SHi^ AGCTACTTAA AAATCGATCT CGCAGTCCCG ATTCGCCTAT CAAAACCAOT TT^TCA^C 
252i!X5 TGCCGAACGA GCTATAAATG ATATAACAAT ATTAAAGCAT TAATTAGAGC 
AATATCAGGC CGCGCACGAA AGGCAACTTA AAAAGCGAAA GCGCTCTACT AAACAGATTA 
SIX^^ AGGCACATCA GTATTTAAAG CCCGAATCCT TATTAAGCGC CGAaC'TCAGG 
SSSS^SSr CATACAGGCA QATAGACCTC TACCTATTAA ATCGGCTTCT AGGCGCGCTC CATC- 
TAAATG TTCTGGCTGT GGTGTACAGG GGCATAAAAT TACGCACTAC CCGAATCGAT AGAACTACTC 
35 ATTTTTATAT AGAAGTCAGA ATTCATAGTG TTTTGATCAT TTTAAATTTT TATATGGCGG 
G I GG ; GGGCA ACTCGCTTGC GCGGGCAACT CGCTTACCGA ^TTACgVtA^^TGA^TTT^ A«^ 
GAAAAT CGTCAAGGGA TGCAAGACCA AAGTAGTAAA ACCCCGGAAG TCAACAGCAT CCAAGCCCAA 
rISIT CA S G A ^ GAAACCCC AGCGTCCACA TCACGAGCGA AGGACCaSct CTAG^CATcf ofct 
^^IS?™ 11 * 0 ** QCAGCAAAGC GAAACAGCCC AAGAAAAAGG TCGGCCCGTC GGCCTTTTCT 
40 rrf™™^ 30 ^ CGATCCAAC C AACACCCTCC AGAGTGACTA GGGGCGGAAA TTTAAAG^ 
GGA TTAATTTCCA CTCAACCACA AATCACAGTC GTCCCCGGTA TTGTCCTGCA GAATGCAATT 
TAAACTCTTC TGCGAATCGC TTGGATTCCC CGCCCCTAGT CGTAGAGCTT AAAGT^tc£ 
?^rZl^IJ 0 £ ATGTATC ACAACATATA AATACTAGCA AGGGATGCCA TGCTTGGAGG AtS 
. «™^% C £^ TCAAGCTCr CCCTTCTCTQ CCCACAG - 3 ™, y A eq AQ : 
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-1167 -1157 -1147 -1137 -1127 

117 6 AGATCTGCCC TTATAAATCT CCTAGTCTGA TCGTCGACGC ATTCCGAATA 

1126 CGAGGCCTGA TTAATGATTA CATACGCCTC CGGGTAGTAG ACCGAGCAGC 

107 6 CGAGCCAGTT CAGCGCCTAA AACGCCTTAT ACAATTAAGC AGTTAAAGAA 

102 6 GTTAGAATC.T ACGCTTAAAA AGCTACTTAA AAATCGATCT CGCAGTCCCG 

-97 6 ATTCGCCTAT CAAAACC AG T TTAAATCAAC TGATTAAAGG TGCCGAACGA 

-92 6 GCTATAAATG ATATAACAAT ATTAAAGCAT TAATTAGAGC AATATCAGGC 

-87 6 CGCGCACGAA AGGCAACTTA AAAAGCGAAA GCGCTCTACT AAACAGATTA 

-826 CTTTTGAAAA AGGCACATCA GTATTTAAAG CCCGAATCCT TATTAAGCGC 

-776 CGAAATCAGG CAGATAAAGC CATACAGGCA GATAGACCTC TACCTATTAA 

-72 6 ATCGGCTTCT AGGCGCGCTC CATCTAAATG TTCTGGCTGT GGTGTACAGG 

-67 6 GGCATAAAAT TACGCACTAC CCGAATCGAT AG AACT AC TC ATTTTTATAT 

-626 AGAAGTCA GA ATTC ATAGTG TTTTGATCAT TTTAAATTTT TATATGGCGG 

EcoRI 

-57 6 GTGGTGGGCA ACTCGCTTGC GCGGGCAACT CGCTTACCGA TTACGTTAGG 
-52 6 GCTGATATTT ACGTGAAAAT CGTCAAGGGA TGCAAGACCA AAGTAGTAAA 
-4 7 6 ACCC 7SGAAG TCAACAGCAT CCAAGCCCAA GTCCTTCACG GAGAAACCCC 
-4 2 6 AGCGTC ~.-.CA TCACGAGCGA AGGACCACCT CTAGGCATCG GACGCACCAT 
-2 7 6 CCAATTA3AA GCAGCAAAGC GAAACAGCCC AAGAAAAAGG TCGGCCCGTC 
-326 GGCCTTTTCT GCAACGCTGA TCACGGGCAG CGATCCAACC AACACCCTCC 
-27 6 AGAGTGACTA GGGGCGGAAA TTTAAAGGGA TTAATTTCCA CTCAACCACA 
-226 AATCACAGTC GTCCCCGGTA TTGTCCTGCA GAATGCAATT TAAACTCTTC 
-17 6 TGCGAATCGC TTGGATTCCC CGCCCCTAGT CGTAGAGCTT AAAGTATGTC 
-12 6 CCTTGTCGAT GCGATGTATC ACAACATATA AATACTAGCA AGGGATGCCA 

-7 6 TGC7TGGAGG ATAGCAACCG ACAACATCAC ATCAAGCTCT CCCTTCTCTG 

-26 AACAATAAAC CCCACAGAAG GCATTT 

4 14 "24 34 44 54 

1 I I I | | 

ATGATGGTCGCGTGGTGGTCTCTATTTCTGTACGGCCTTCAGGTCGCGGCACCTGCTTTG 
MetMetValAlaTrpTrpSerLeuPheLeuTyrGlyLeuGlnValAlaAlaProAlaLeu 

64 74 84 94 104 114 

I I I I | | 

GCTGCAACGCCTGCGGACTGGCGATCGCAATCCATTTATTTCCTTCTCACGGATCGATTT 
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AAC ACT GTT CAC GTC AAC CTC TAC AAG CAA GGC TTG ATC TCT TCT CCT 7DB 

ASM THR VAL HIS VAL ASN LEU TYR LYS GLN GLY LEU ILE SER SER PRO 167 

CTT TTC TCG GTC TAC ATG AAC ACT AAC AGC GGC ACT GGA GAG GTC GTC 7SG 

LEU PHE SER VAL TYR MET ASN THR ASN SER GLY THR GLY GLU VAL VAL 183 

TTT GGT GGA GTC AAC AAC ACG CTT CTC GGC GGC GAC ATT GCC TAC ACG ana 

PHE GLY GLY VAL ASN ASN THR LEU LEU GLY GLY A%P ILE IS 7W THR 199 

55£ SIT" 515 CG J TAT GGT GGT TAT TAC TTC TGG GAC GCA CCC GTC 852 

ASP VAL MET SER ARG TYR GLY GLY TYR TYR PHE TRP ASP ALA PRO VAL 215 

ACA GGT ATC ACC GTC GAT GGA TCT GCT GCT GTC AGG TTC TCG AGA CCC 900 

THR GLY ILE THR VAL ASP GLY SER ALA ALA VAL ARG PHE SER ARG PRO 231 

CAA GCA TTC ACC ATC GAT ACT GGC ACC AAC TTT TTC ATT' ATG CCC TCA 948 

GLN ALA PHE THR ILE ASP THR GLY THR ASN PHE PHE ili MET PRO ZeR 247 

52£ G Jr° ?f I 111 *** ATT GTC M GCA GCT CTC CCT GAT GCC ACT GAA 996 

SER ALA ALA SER LYS ILE VAL LYS ALA ALA LEU PRO ASP ALA THR GLU 263 

ACC CAG CAG GGC TGG GTT GTT CCT TGC GCT AGC TAC CAG AAC TCC AAG 1D44 

THR GLN GLN GLY TRP VAL VAL PRO CYS ALA SER TYR GLN ASN SER LYS 2 V 9 

TCG ACT ATC AGC ATC GTC ATG CAA AAG TCC GGC TCA AGC AGT GAC ACT 1092 

SER THR ILE SER ILE VAL MET GLN LYS SER GLY SER SER SER aTp THR 295 

ill £££ m T - CG GTT CCT GTC AGC AAA ATG CTT CTT CCA GTC GAC CAA 1140 

ILE GLU ILE SER VAL PRO VAL SER LYS MET LEU LEU PRO VAL ASP GLN 311 

TCG AAC GAG ACT TGC ATG TTT ATC ATT CTT CCC GAC GGT GGT AAC CAG 1188 

SER ASN GLU THR CYS MET PHE ILE ILE LEU PRO ASP GLY GLY ASN GLN 327 

Z2£ t T I GTT GGC AAC TTG TTC CTG CGC TTC TTT GTC AAT GTT TAC GAC 1236 

TYR ILE VAL GLY ASN LEU PHE LEU ARG PHE PHE VAL ASN VAL TYR ASP 343 

111 %Iy HI HZ 2£ S?S 12 G Jr A CCT TTG GCC TCG GCT TAT GM 128 4 

PHE GLY ASN ASN ARG ILE GLY PHE ALA PRO LEU ALA SER ALA TYR GLU 359 

ASN GLU TERM AGGGGCACCAATTCTTCTTTAGCTGCTC ^^ 1343 



TATACTCTTTATAACCTTTATTTCTCACTTTTTAACTGTATTCCAATACATTATTTCCT 
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PIG. 12 



"1 1 

ATCAGAATC ATG GTT CTC AAG CAG CGT 6CA AAC TAT CTG GGC TTT CTG ATT GTA TTC TTC 51 

MET VAL LEU LYS GLN ARC ALA ASN TYR LEU GLY PHE LEU ILE VAL PHE PHE -78 

ACG GCG TTC CTG GTC GAA GCC GTG CCA ATC AAG AGA CAA TCA AAC AGC ACG'GTG GAT ACT 111 

THR ALA PHE LEU VAL GLU ALA VAL PRO ILE LYS ARC GLN SER ASN SER THR VAL ASP SER -58 

CTG CCA CCC CTC ATC CCC TCT CGA ACC TCG GCA CCT TCA TCA TCA CCA AGC ACA ACC GAC 171 

LEU PRO PRO LEU ILE PRO SER AR G THR SER ALA PRO SER SER SER PRO SER THR THR ASP -38 

CCT GAA GCT CCA GCC ATG AST CGC AAT GGA CCG CTG CCC TCG GAT GTA GAG ACT AAA TAT 231 

PRO GLU ALA PRO ALA HET SER ARG ASN GLY PRO LEU PRO SER ASP VAL GLU THR LYS TYR -18 

GGC ATG GCT TTG AAT GCT ACT TCC TAT CCG GAT TCT GTG GTC CAA GCA ATG AGC ATT GAT 291 

GLY HET ALA LEU ASN ALA THR SER TYR PRO ASP SER VAL VAL GLN ALA HETjSER ILE ASP 3 

GGT GGT ATC CGC GCT GCG ACC TCG CAA GAA ATC AAT GAA TTG ACT TAT TAC ACT ACA CTA 351 

GLY GLY ILE ARG ALA ALA THR SER GLN GLU ILE ASN GLU LEU THR TYR TYR THR THR LEU 23 

TCT GCC AAC TCG TAC TGC CGC ACT GTC ATT CCT GGA GCT ACC TGG GAC TGT ATC CAC TGT 411 

SER ALA ASN SER TYR CYS ARG THR VAL ILE PRO GLY ALA THR TRP ASP CYS ILE HIS CYS 43 

GAT GCA ACG GAG GAT CTC AAG ATT ATC AAG ACT TGG AGC ACG CTC ATC TAT GAT ACA AAT 471 

ASP ALA THR GLU ASP LEU LYS ILE ILE LYS THR TRP SER THR LEU ILE TYR ASP THR ASN 63 

GCA ATG GTT GCA CGT GGT GAC AGC GAA AAA ACT ATC TAT ATC GTT TTC CGA GGT TCG AGC 531 

ALA HET VAL ALA-ARG . GLY -ASP SER- GLU LYS THR ILE TYR ILE VAL PHE ARG GLY SER SER 83 : _-. 

TCT ATC CGC AAC TGG ATT GCT GAT CTC ACC TTT GTG CCA GTT TCA TAT CCT CCG GTC AGT 591 

SER ILE ARG ASN TRP ILE ALA ASP LEU THR PHE VAL PRO VAL SER TYR PRO PRO VAL SER 103 

GGT ACA AAA GTA CAC AAG GGA TTC CTG GAC AGT TAC GGG GAA GTT CAA AAC GAG CTT GTT 651 

GLY THR LYS VAL HIS LYS GLY PHE LEU ASP SER TYR GLY GLU VAL GLN ASN GLU LEU VAL 123 

GCT ACT GTT CTT GAT CAA TTC AAG CAA TAT CCA AGC TAC AAG GTT GCT GTT ACA GGT CAC 711 

ALA THR VAL LEU ASP GLN PHE LYS GLN TYR PRO SER TYR LYS VAL ALA VAL THR GLY HIS 143 

TCA CTC GGT GGT GCT ACT GCG TTG CTT TGC GCC CTG GGT CTC TAT CAA CGA GAA GAA GGA 771 

SER LEU GLY GLY ALA THR ALA LEU LEU CYS ALA LEU GLY LEU TYR GLN ARG GLU GLU GLY 163 

CTC TCA TCC AGC AAC TTG TTC CTT TAC ACT CAA GGT CAA CCA CGC GTA GGC GAC CCT GCC 831 

LEU SER SER SER ASN LEU PHE LEU TYR THR GLN GLY GLN PRO ARG VAL GLY ASP PRO ALA 183 

TTT GCC AAC TAC GTT GTT AGC ACC GGC ATT CCT TAC AGG CGC ACG GTC AAT GAA CGA GAT 831 

PHE ALA ASN TYR VAL VAL SER THR GLY ILE PRO TYR ARG ARG THR VAL ASN GLU ARG ASP 203 

ATC GTT CCT CAT CTT CCA CCT GCT GCT TTT GGT TTT CTC CAC GCT GGC GAG GAG TAT TGG 351 

ILE VAL PRO HIS LEU PRO PRO ALA ALA PHE GLY PHE LEU HIS ALA GLY GLU GLU TYR TRP 223 

ATT ACT GAC AAT AGC CCA GAG ACT GTT CAG GTC TGC ACA AGC GAT CTG GAA ACC TCT GAT 1011 

ILE THR ASP ASN SER PRO GLU THR VAL GLN VAL CYS THR SER ASP LEU GLU THR SER ASP 243 

TGC TCT AAC AGC ATT GTT CCC TTC ACA AGT GTT CTT GAC CAT CTC TCG TAC TTT GGT ATC 1071 

CYS SER ASN SER ILE VAL PRO PHE THR SER VAL LEU ASP HIS LEU SER TYR PHE GLY ILE 263 

AAC ACA GGC CTC TGT ACT TAA GAAATACCAGTTATACGATATGTAGGAAGTAGTATTTTTTAGGAAGAGATT 1131 
ASN THR GLY LEU CYS THR TERM . 

TATATGTATTAAACAAATATATATATATATACCGCTGCGCGAGAACCTGTATT POL YA 
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